NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Stratified Adversarial Robustness with Rejection

Chen, Jiefeng; Raghuram, Jayaram; Choi, Jihye; Wu, Xi; Liang, Yingyu; Jha, Somesh (July 2023, International Conference on Machine Learning)

Full Text Available
The Trade-off between Universality and Label Efficiency of Representations from Contrastive Learning

Shi, Zhenmei; Chen, Jiefeng; Li, Kunyang; Raghuram, Jayaram; Wu, Xi; Liang, Yingyu; Jha, Somesh (May 2023, International Conference on Learning Representations)

Full Text Available
GRAPHITE: Generating Automatic Physical Examples for Machine-Learning Attacks on Computer Vision Systems

Feng, Ryan; Mangaokar, Neal; Chen, Jiefeng; Fernandes, Earlence; Jha, Somesh; Prakash, Atul (June 2022, 7th IEEE European Symposium on Security and Privacy)

This paper investigates an adversary's ease of attack in generating adversarial examples for real-world scenarios. We address three key requirements for practical attacks for the real-world: 1) automatically constraining the size and shape of the attack so it can be applied with stickers, 2) transform-robustness, i.e., robustness of a attack to environmental physical variations such as viewpoint and lighting changes, and 3) supporting attacks in not only white-box, but also black-box hard-label scenarios, so that the adversary can attack proprietary models. In this work, we propose GRAPHITE, an efficient and general framework for generating attacks that satisfy the above three key requirements. GRAPHITE takes advantage of transform-robustness, a metric based on expectation over transforms (EoT), to automatically generate small masks and optimize with gradient-free optimization. GRAPHITE is also flexible as it can easily trade-off transform-robustness, perturbation size, and query count in black-box settings. On a GTSRB model in a hard-label black-box setting, we are able to find attacks on all possible 1,806 victim-target class pairs with averages of 77.8% transform-robustness, perturbation size of 16.63% of the victim images, and 126K queries per pair. For digital-only attacks where achieving transform-robustness is not a requirement, GRAPHITE is able to find successful small-patch attacks with an average of only 566 queries for 92.2% of victim-target pairs. GRAPHITE is also able to find successful attacks using perturbations that modify small areas of the input image against PatchGuard, a recently proposed defense against patch-based attacks.
more » « less
Full Text Available
Towards Evaluating the Robustness of Neural Networks Learned by Transduction

Chen, Jiefeng; Wu, Xi; Guo, Yang; Liang, Yingyu; Jha, Somesh (January 2022, International Conference on Learning Representations)

Full Text Available
Robust Attribution Regularization

Chen, Jiefeng; Wu, Xi; Rastogi, Vaibhav; Liang, Yingyu; Jha, Somesh (December 2019, Conference on Neural Information Processing Systems)

An emerging problem in trustworthy machine learning is to train models that pro- duce robust interpretations for their predictions. We take a step towards solving this problem through the lens of axiomatic attribution of neural networks. Our theory is grounded in the recent work, Integrated Gradients (IG) [STY17], in axiomatically attributing a neural network’s output change to its input change. We propose training objectives in classic robust optimization models to achieve robust IG attributions. Our objectives give principled generalizations of previous objectives designed for robust predictions, and they naturally degenerate to classic soft-margin training for one-layer neural networks. We also generalize previous theory and prove that the objectives for different robust optimization models are closely related. Experiments demonstrate the effectiveness of our method, and also point to intriguing problems which hint at the need for better optimization techniques or better neural network architectures for robust attribution training.
more » « less
Full Text Available

Search for: All records